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LARGE GRAPH LIMIT FOR AN SIR PROCESS IN RANDOM 
"CS ■ NETWORK WITH HETEROGENEOUS CONNECTIVITY 
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^ I Telecom Paristech, Universite Paris 13, Universite de Technologie de 

^ ■ Compiegne, and Universite des Sciences et Technologies Lille 1 and Ecole 

QQ \ Polytechnique 

^ We consider an SIR epidemic model propagating on a configura- 

^^, ' tion model network, where the degree distribution of the vertices is 

f^ , , given and where the edges are randomly matched. The evolution of 

• ■ the epidemic is summed up into three measure- valued equations that 

'■pi I describe the degrees of the susceptible individuals and the number of 

C^ ■ edges from an infectious or removed individual to the set of suscep- 

G ' tibles. These three degree distributions are sufficient to describe the 

' '. course of the disease. The limit in large population is investigated. As 

a corollary, this provides a rigorous proof of the equations obtained 

'nI" I by Volz [Mathematical Biology 56 (2008) 293-310]. 

oo. 

pD \ 1. Introduction and notation. In this work, we investigate an epidemic 

^^ ■ spreading on a random graph with fixed degree distribution and evolving 

I — I . according to an SIR model as follows. Every individual not yet infected is 

f^ I assumed to be susceptible. Infected individuals stay infected during random 

^^ ■ exponential times with mean 1//3 during which they infect each of their sus- 

. . , ceptible neighbors with rate r. At the end of the infectious period, the indi- 

, ^ I vidual becomes removed and is no longer susceptible to the disease. Contrary 

^ ■ to the classical mixing compartmental SIR epidemic models (e.g., [5, 17] and 

H . see [2], Chapter 2, for a presentation), heterogeneity in the number of con- 

■ - - ■ tacts makes it difficult to describe the dynamical behavior of the epidemic. 



Received October 2010; revised February 2011. 

^Supported by ANR Viroscopy (ANR-08-SYSC-016-03) and MANEGE (ANR-09- 
BLAN-0215), and from the "Chair Modelisation Mathematique de la Biodiversite of Veolia 
Environnement-Ecole Polytechnique-Museum National d'Histoire Naturelle-Fondation X." 

AMS 2000 subject classifications. 60J80, 05C80, 92D30, 60F99. 

Key words and phrases. Configuration model graph, SIR model, mathematical model 
for epidemiology, measure-valued process, large network limit. 



This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Applied Probability, 
2012, Vol. 22, No. 2, 541-575. This reprint differs from the original in pagination 
and typographic detail. 

i 



2 DECREUSEFOND, DHERSIN, MOYAL AND TRAN 

Mean field approximations (e.g., [4, 10, 23]) or large population approxima- 
tions (e.g., [3], see also equation (3) of [1] in discrete time) provide a set of 
denumerable equations to describe our system. We are here inspired by the 
paper of Volz [27] , who proposes a low-dimensional system of five differential 
equations for the dynamics of an SIR model on a configuration model (CM) 
graph [7, 19]. We refer to Volz's article for a bibliography about SIR models 
on graphs (see also Newman [20, 21], Durrett [10] or Barthelemy et al. [4]). 
Starting from a random model in finite population, Volz derives determinis- 
tic equations by increasing the size of the network, following in this respect 
works of Newman, for instance, [21]. The convergence of the continuous- 
time stochastic SIR model to its deterministic limit for large graphs was, 
however, not proved. In this paper, we prove the convergence that was left 
open by Volz. To achieve this, we provide a rigorous individual-based de- 
scription of the epidemic on a random graph. Three degree distributions are 
sufficient to describe the epidemic dynamics. We describe these distributions 
by equations in the space of measures on the set of nonnegative integers, of 
which Volz's equations are a by-product. Starting with a node-centered de- 
scription, we show that the individual dimension is lost in the large graph 
limit. Our construction heavily relies on the choice of a CM for the graph 
underlying the epidemic, which was also made in [27]. 

The size N of the population is fixed. The individuals are related through 
a random network and are represented by the vertices of an undirected 
graph. Between two neighbors, we place an edge. The graph is nonoriented 
and an edge between x and y can be seen as two directed edges, one from x 
to y and the other from y to x. If we consider an edge as emanating from 
the vertex x and directed to the vertex y, we call x the ego of the edge 
and y the alter. The number of neighbors of a given individual is the degree 
of the associated vertex. The degree of x is denoted d^. It varies from an 
individual to another one. The CM developed in Section 2.1 is a random 
graph where individuals' degrees are independent random variables with 
same distribution {pk)keN- Edges are paired at random. As a consequence, 
for a given edge, alter has the size-biased degree distribution: the probability 
that her degree is k is kpk/Yli&i^P^- 

The population is partitioned into the classes of susceptible, infectious or 
removed individuals. At time t, we denote by St, it and Rt the set of suscep- 
tible, infectious and removed nodes. We denote by St, h and Rt the sizes of 
these classes at time t. With a slight abuse, we will say that a susceptible 
individual is of type S (resp., of type i or r) and that an edge linking an 
infectious ego and susceptible alter is of type IS (resp., RS, ii or ir). For 
X € I (resp., r), dx{s) represents the number of edges with x as ego and 
susceptible alter. The numbers of edges with susceptible ego (resp., of edges 
of types IS and RS) are denoted by A^t^ (resp., N]^ and Nf^). 
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A possible way to rigorously describe the epidemics' evolution is given in 
Section 2.2. We consider the subgraph of infectious and removed individuals 
with their degrees. Upon infection, the infectious ego chooses the edge of 
a susceptible alter at random. Hence, the latter individual is chosen pro- 
portionally to her degree. When she is connected, she uncovers the edges to 
neighbors that were already in the subgraph. 

We denote by N the set of nonnegative integers and by N* = N \ {0}. 
The space of real bounded functions on N is denoted by -Bb(N). For any 
/ € BbCN), set ||/||cxD the supremum of / on N. For all such / and y € N, we 
denote by Tyf the function x i-^ f{x — y). For all n S N, x" is the function 
X I— >■ x", and in particular, x = X^ is the identity function and l = x^ is the 
function constantly equal to 1. 

We denote by A4i;'(N) the set of finite measures on N, embedded with the 
topology of weak convergence. For all fi € A^i7'(N) and / E Bb(N), we write 

{^^J) = Y.f{k)^^{{k}). 

ken 

With some abuse of notation, for all fi € A4ir(N) and A; G N, we denote 
^(A;) = fi{{k}). For x G N, we write 5x for the Dirac measure at point x. 
Note, that some additional notation is provided in the Appendix, together 
with several topological results, that will be used in the sequel. 

The plan of the paper and the main results are described below. In Sec- 
tion 2, we describe the mechanisms underlying the propagation of the epi- 
demic on the CM graph. To describe the course of the epidemic, rather than 
the sizes St, It and Rt, we consider three degree distributions given as point 
measures of A^ir(N), for t > 0: 



fJ'l = Y.^d,^ ^if = J2^d,{St), 



xSSt x&t 

(1-1) 

xGRt 

Notice that the measures /J-t/St, lA^ /It and nf^/Rt are probability measures 
that correspond to the usual (probability) degree distribution. The degree 
distribution ^f of susceptible individuals is needed to describe the degrees 
of the new infected individuals. The measure ^p provides information on 
the number of edges from ij to St, through which the disease can propagate. 
Similarly, the measure //f ^ is used to describe the evolution of the set of 
edges hnking St to Rf. We can see that Nf = (//f ,x) and St = (/xf , 1) (and, 
accordingly, for Nj^, N^^, It and Rt). 

In Section 3, we study the large graph limit obtained when the num- 
ber of vertices tends to infinity, the degree distribution being unchanged. 
The degree distributions mentioned above can then be approximated, after 
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proper scaling, by the solution {p,^ , p}f , flf^)t>o of the system of deterministic 
measure-valued equations (1.3)-(1.5) with initial conditions /2q, /2p and fi^^. 
For all t > 0, we denote by N^ = (/if , x) (resp., N]:^ = {fl]^,x) and Nj^^ = 
{fi^^,x)) the continuous number of edges with ego in s (resp., IS edges, RS 
edges). Following Volz [27], pertinent quantities are the proportions p] = 
Nl^/N^ [resp., pf = N]^^/N^ and pi = (iVf - iVf - N^)/N^] of edges with 
infectious (resp., removed, susceptible) alter among those having susceptible 
ego. We also introduce 

(1.2) et = exp(-rj plds\ 

the probability that a degree one node remains susceptible until time t. For 
any/G^b(N), 

(1.3) {plf) = Y,-^l{k)e^f{k), 

fceN 

ft 

-JS f\ _ /-IS ^\ / Q/-IS 



(/-4^/) = (/io J)- / msj)ds 

Jo 



+ 1 ^rkpl Y^ 






k-\ 



[p\ym\pir 



j+£+m=k-l 

(1.4) xf{m)pl{k)ds 



I Y,rkp\{l + {k-l)pl) 



X E(/(^'-l)-/(^'))^'%F/^'(^')d«' 



fc'eN* 
Jo 



k'pfjk') 



(1.5) + fY.rkp\{k-l)p^ Yl ifik'-l)-f{k')) 

A;'/if(A;')-s.M^ 

We denote by St (resp.. It and Rt) the mass of the measure pt (resp., /2p 
and pf^)- As for the finite graph, pi /St (resp., p]^/It and pf^/Rt) is the 
probability degree distribution of the susceptible individuals (resp., the prob- 
ability distribution of the degrees of the infectious and removed individuals 
toward the susceptible ones). 
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Let us give a heuristic explanation of equations (1.3)-(1.5). Remark that 
the graph in the hmit is infinite. The probabihty that an individual of de- 
gree k has been infected by none of her k edges is 6^ and equation (1.3) 
follows. In (1.4), the first integral corresponds to infectious individuals be- 
ing removed. In the second integral, rkp]. is the rate of infection of a given 
susceptible individual of degree k. Once she gets infected, the multinomial 
term determines the number of edges connected to susceptible, infectious 
and removed neighbors. Multi-edges do not occur. Each infectious neigh- 
bor has a degree chosen in the size-biased distribution k'p}^{k')/N^^ and 
the number of edges to Sf is reduced by 1. This explains the third integral. 
Similar arguments hold for (1.5). 

Choosing f{k) = li{k), we obtain the following countable system of ordi- 
nary differential equations (ODEs): 

fili^ = film, 

fifi^=fio(i)+ / jrp^. J](i + j + £+l)/if(i + j+^ + l) 

(1-6) + {riplfifllx^ -X)+ rplifllx)) 

^,(. + l)/2-(^ + l)-z;.fW_^-.(a,,^ 



{fif,x) 



/^r(0=4^(^)+ «(^) 



/{' 



-I/-S 2 v-R(^ + l)Ai^^(^ + l)-^/"r(01 1 

+ rplinlx' - X)pr- ()zHS,^) j'^'- 

It is noteworthy to say that this system is similar but not identical to that 
in Ball and Neal [3]. Our equations differ since our mechanism is not the 
same (compare Section 2.2 with Section 5 in [3]). We emphasize that the 
number of links of an individual to S decreases as the epidemic progresses, 
which modifies her infectivity. 

The system (1.3)-(1.5) allows us to recover the equations proposed by 
Volz [27], Table 3, page 297. More precisely, the dynamics of the epidemic is 
obtained by solving the following closed system of four ODEs, referred to as 
Volz's equations in the sequel. The latter are obtained directly from (1.3)- 
(1.5) and the definitions of St, It, p\ and p^ which relate these quantities to 
the measures /if and /i^. Let 

(1.7) g{z) = Y,m)z' 
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be the generating function for the initial degree distribution of the suscepti- 
ble individuals p.Q, and let 6t = exp(— r Jg p^ds). Then, the epidemic can be 
approximated by the solution of the four following ODEs: 

(1.8) St = {fill) =g{9t), 

(1.9) It = {fif, 1) = /o + / {rpl9sg'{9s) - Ph) ds, 

JO 

(1.10) p\=pI + I^ (rplp%^^ - rplil - pl) - /5pi) ds, 

(1.11) Pl =Pl + [rp\pl (l - ^.^) d.. 

Here, the graph structure appears through the generating function g. In (1.9), 
we see that the classical contamination terms rSth (mass action) or rSth/ 
{St + h) (frequency dependence) of mixing SIR models (e.g., [2, 9]) are re- 
placed by rp\9tg'{0t) =rNp. The fact that new infectious individuals are 
chosen in the size-biased distribution is hidden in the term g" {9t) / g' {9t) ■ 

The beginning of the epidemic and computation of the reproduction num- 
ber, when the numbers of infected individuals and of contaminating edges 
are small and when Volz's deterministic approximation does not hold, makes 
the object of another study. 

2. SIR model on a configuration model graph. In this section, we intro- 
duce configuration model graphs and describe the propagation of SIR on 
such graphs. 

2.1. Configuration model graph. Graphs at large can be mathematically 
represented as matrices with integer entries: to each graph corresponds an 
adjacency matrix, the (x,y)th coefficient of which is the number of edges 
between the vertices x and y. Defining the distribution of a random graph 
thus amounts to choosing a sigma-field and a probability measure on the 
space N^' ^^* , where N* = N \ {0}. Another approach is to construct a ran- 
dom graph by modifying progressively a given graph, as in Erdos-Renyi 
model. Several other constructions are possible such as the preferential at- 
tachment model, the threshold graphs, etc. 

Here, we are interested in the configuration model (CM) proposed by 
Bohobas [7], Molloy and Reed [19] (see also [10, 21, 22, 26]) and which 
models graphs with specified degree distribution and independence between 
the degrees of neighbors. As shown by statistical tests, these models might 
be realistic in describing community networks. See, for instance, Clemengon 
et al. [8] for dealing with the spread of the HIV-AIDS disease among the 
homosexual community in Cuba. 
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We recall its construction (see, e.g., [10, 26]). Suppose we are given the 
number of vertices, N and i.i.d. random variables (r.v.) di, . . . ,d]\[ with dis- 
tribution (pfc)fcGN that represent the degrees of each vertex. To the vertex i 
are associated di half-edges. To construct an edge, one chooses two open 
half-edges uniformly at random and pair them together. 

Remark that this linkage procedure does not exclude self-loops or multiple 
edges. In the following, we are interested in a large number of nodes with 
a fixed degree distribution, hence self-loops and multiple edges become less 
and less apparent in the global picture (see, e.g., [10], Theorem 3.1.2). 

Notice that the condition for the existence with positive probability of 
a giant component is that the expectation of the size biased distribution is 
larger than 1: 

ken l^ienm 

This is connected with the fact that the Galton- Watson tree with this off- 
spring distribution is supercritical (see [10], Section 3.2, page 75, for details). 

2.2. SIR epidemic on a CM graph. We now propagate an epidemic on 
a CM graph of size A^ (see Figure 1). The disease can be transmitted from 
infectious nodes to neighboring susceptible nodes and removed nodes cannot 
be reinfected. 

Suppose that at initial time, we are given a set of susceptible and infectious 
nodes together with their degrees. The graph of relationships between the 
individuals is, in fact, irrelevant for studying the propagation of the disease. 
The minimal information consists in the sizes of the classes S, i, R and the 
number of edges to the class S for every infectious or removed node. Thus, 
each node of class S comes with a given number of half-edges of undetermined 
types; each node of class i (resp., r) comes with a number of IS (resp., RS) 
edges. The numbers of IR, ii and RR edges need not to be retained. 

The evolution of the SIR epidemic on a CM graph can be described as 
follows. To each is-type half-edge is associated an independent exponential 
clock with parameter r and to each I vertex is associated an independent 
exponential clock with parameter /3. The first of all these clocks that rings 
determines the next event. 

Case 1. If the clock that rings is associated to an i individual, the latter re- 
covers. Change her status from i to R and the type of her emanating 
half-edges accordingly: IS half-edges become RS half-edges. 
Case 2. If the clock is associated with a half is-edge, an infection occurs. 
Step 1. Match randomly the is-half-edge that has rung to a half- 
edge belonging to a susceptible. 
Step 2. This susceptible is the newly infected. Let k be her degree. 
Choose uniformly k — 1 half-edges among all the available 
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Fig. 1. Infection process. Arrows provide the infection tree. Susceptible, infectious and 
removed individuals are colored in white, grey and black, respectively, (a) The degree of 
each individual is known, and for each infectious (resp., removed) individual, we know her 
number of edges of type IS (resp., BS). (b), (c) A contaminating half-edge is chosen, and 
say that a susceptible of degree k is infected at time t. The contaminating edge is drawn 
in bold line, (d) Once the susceptible individual has been infected, we determine how many 
of her remaining arrows are linked to the classes I and R. // we denote by j and i these 



numbers, then NJ^ = N}^ 



l + {k-l)-j~e and iVf ^ = N^^ - I. 



half-edges (they either are of type IS, RS or emanate from s). 
Let m (resp., j and i) be the number of SS-type (resp., of 
IS and of RS-type) half-edges drawn among these k — \ half- 
edges. 

Step 3. The chosen half-edges of type IS and RS determine the infec- 
tious or removed neighbors of the newly infected individual. 
The remaining m edges of type SS remain open in the sense 
that the susceptible neighbor is not fixed. Change the sta- 
tus of the m (resp., j, t) SS-type (resp., is-type, RS-type) 
edges created to Si-type (resp., ii-type, Ri-type). 

Step 4. Change the status of the newly infected from s to i. 

We then wait for another clock to ring and repeat the procedure. 

We only need three descriptors of the system to obtain a Markovian evo- 
lution, namely, the three degree distributions introduced in (1.1). 

For a measure ^ G A1_f(N), we denote by Ffj,{m) = fi{{0, . . . ,m}), m G N, 
its cumulative distribution function. We introduce F~^ its right inverse (see 
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the Appendix). Then, for ah < i < ^^ (resp., < i < /^ and < i < Rt), 

l^i^^l) = Fjii) [resp., 7^(/^P) = i^ ;i(i),7^(/^f ) = Pj^ii)] 

represents the degree at t of the ith susceptible individual (resp., the number 
of edges to S of the ith. infectious individual and of the ith removed individ- 
ual) when individuals are ranked by increasing degrees (resp., by number of 
edges to s). 

Example 1. Consider, for instance, the measure /i = 26i + 36^ + 67. 
Then, the atoms 1 and 2 are at level 1, the atoms 3, 4 and 5 are at level 5 and 
the atom 6 is at level 7. We then have that 71 (/i) = F~^{1) = 1, 72(^) = 1, 
73 (^) = 74 (^) = 75 (m) = 5 and 75 (^) = 7. 

From t, and because of the properties of exponential distributions, the 
next event will take place in a time exponentially distributed with parameter 
rNp + /3/j. Let T denote the time of this event. 

Case 1. The event corresponds to a removal, that is, a node goes from 
status I to status R. Choose uniformly an integer i in I^- , then update the 
measures fj}p and ^j.^ : 

,,IS _ ,,IS e „„i ,,RS_,,RSj_r 

^r - /i^_ - d^^ (^B ) and fiT - Mt- + % if,js_ ) • 

The probability that a given integer i is drawn is 1/It- ■ 

Case 2. The event corresponds to a new infection. We choose uniformly 
a half-edge with susceptible ego, and this ego becomes infectious. The global 
rate of infection is rN^ and the probability of choosing a susceptible indi- 
vidual of degree k for the new infectious is kfij^ {k)/Nj^ . We define by 

(2.1) XTAk) = rk^ 

the rate of infection of a given susceptible of degree k at time T_. This 
notation was also used in Volz [27]. 

The newly infective may have several links with infectious or removed 
individuals. The probability, given that the degree of the individual is k and 
that j (resp., i) out of her k — 1 other half-edges (all but the contaminating 
IS edge) are chosen to be of type 11 (resp., ir), according to Step 2, is given 
by the following multivariate hyper geometric distribution: 



(2.2) pT_{j,> 



/NJj? -U fN^^-. /7V|_ -TVJ? -AfRS 



j ^ ^ l ' ^ fe-1- 



A I 

.IS n,r,A i.RS 



Finally, to update the values of /U^ and /Uy given /c, j and £, we have to 
choose the infectious and removed individuals to which the newly infectious 
is linked; some of their edges, which were IS or RS, now become 11 or Ri. 
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We draw two sequences u = {ui , . . . , uj^, ) and v = {vi,. . . , vrj, ) that will 
indicate how many links each infectious or removed individual has to the 
newly contaminated individual. There exist constraints on u and v. the 
number of edges recorded by u and v cannot exceed the number of existing 
edges. Let us define the set 

+00 
(2.3) ZY=|Jn'", 

m=l 

and for all finite integer-valued measure fi on N, and all integer n S N, we 
define the subset 

U{n,n) = lu = {ui, . . . ,n^^^i)) gU such that 
(2.4) 

Vie {l,...,{fj,,l)},Ui <F~^{i) and y^ n» = n I . 

j=i J 

Each sequence u (zU{'n,fi) provides a possible configuration of how the n 
connections of a given individual can be shared between neighbors whose 
degrees are summed up by ^. The component Wj, for 1 < i < (/i, 1), provides 
the number of edges that this individual shares with the ith individual. This 
number is necessarily smaller than the degree 7j(/i) = F~^{i) of individual i. 
Moreover, the UiS sum to n. The probabilities of the draws of u and v 
that provide, respectively, the number of edges IS which become 11 per infec- 
tious individual and the number of edges RS which become Rl per removed 
individual are given by 

p{u\j + l,^lT_) = —^ Km{3+i,f^^y 

(2.5) 

^ t > 
Then, we update the measures as follows: 

pj, = p^_ — Sk , 

It_ 



(2.6) fi's = fi'S- + ^k^i^j^i + Y. Ki^^w )-u, - ^■ 



i=l 



Rt_ 



^4' = ^4'- + Y.^A,W)-^,-^A 
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2.3. Stochastic differential equations. Here, we propose stochastic dif- 
ferential equations (SDEs) driven by Poisson point measures (PPMs) to 
describe the evohition of the degree distributions (1.1), following the in- 
spiration of [9, 13]. We consider two PPMs: dQ^{s,k,6i,j,i,92,u,6s,v,94) 
and dQ^{s, i) on M+ x Ei with ^i := N x M+ x N x N x M+ x ZY x M+ x 
U X M_|_ and ]R+ x N with intensity measures dq^{s,k,6i,j,i,62,u,63,v,94) = 
ds dn{k) (g) d9i dn{j) (g) dn(^) d92 dn(u) (g) d93 (g) dn{v) (g d04 and 
dg^(s,i) = /3ds (g dn(i), where ds, d^i, d02, d^a and d^4 are Lebesgue mea- 
sures on M+, where dn{k), dn{j), dn{i) are counting measures on N and 
where dn{u), dn{v) are counting measures on U. 

The point measure Q^ provides possible times at which an infection may 
occur. Each of its atoms is associated with a possible infection time s, an in- 
teger k which gives the degree of the susceptible being possibly infected, the 
number j + 1 and i of edges that this individual has to the sets is_ and Rs_ . 
The marks u and v aU are as in the previous section. The marks 9i, 92 
and ^3 are auxiliary variables used for the construction [see (2.8) and (2.9)]. 

The point measure Q^ gives possible removal times. To each of its atoms 
is associated a possible removal time s and the number i of the individual 
that may be removed. 

The following SDEs describe the evolution of the epidemic: for all i > 0, 



(2.7) 



I El 



^ ^92<Ps_UAk-l)h3<p{u\j+l,,,ls_)h4<p{v\e,^lfS)^Q , 



(2.8) 




f^f = l'^o+ / -5^-0+1+^) +E('^7.(MiS)-n,- -5. 



Jn 



(2.9) 



i=l / 

Ak)f,s_(k)^e2<ps^ij/\k-i)hs<p{u\j+i,f,is_) 

X hi<\s_{.k)fJ=_{k)^e2<PsAJAk~i)h-i<p(u\j+i,fif_) 
X hi<p{v\i,pf^) d<3^ 

12 



'l^{^^'t) 



)^i<h dQ^ 
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where we write dQ^ and dQ^ instead of dQ^{s,k,9i,j,£,62,u,9s,v,64) and 
dQ'^{s,i) to simplify the notation. 

Proposition 2.1. For any given initial conditions fi^, /_fQ^ and i^Lq^ 
that are integer-valued measures on N and for PPMs Q^ and Q^, there 
exists a unique strong solution to the SDEs (2.7)~(2.9) in the space 25(1^^, 
(A4i;'(N))^), the Skohorod space of cddldg functions with values in {A4f(^))'^- 

Proof. For the proof we notice that for every t € M_|_, the measure /if 
is dominated by fiQ and the measures fi]^ and fif'^ have a mass bounded 
by (a^o + A*cf + Mo^)l) ^''^d a support included in [[0,max{max(supp(//Q)), 
max(supp(/fQ^)), max(supp(//Q^))}]]. The result then follows the steps of [13] 
and [25] (Proposition 2.2.6). D 

The course of the epidemic can be deduced from (2.7), (2.8) and (2.9). 
For the sizes {St,It,Rt)teR+ of the different classes, for instance, we have 
with the choice of / = 1, that for all t > 0, St = {fJ-t, 1), It = (/^p, 1) and 
Rt = {fj.f'^,1). Writing the semimartingale decomposition that results from 
standard stochastic calculus for jump processes and SDE driven by PPMs 
(e.g., [13-15]), we obtain, for example, 

(2.10) It = (/if, 1) = Jo + / f V l^tik)\s{k) - pA ds + Ml, 

where M^ is a square-integrable martingale that can be written explicitly 
with the compensated PPMs of Q^ and Q^, and with predictable quadratic 
variation given for all i > by 

{M')t= I Y,{^4ik)\s{k) + I31s)ds. 

Another quantities of interest are the numbers of edges of the different 
types Nf , Nj^, Nj^^. The latter appear as the first moments of the mea- 
sures /if, /if and nf^ : N^ = (/if , x), A^f = (/"f , x) and iV/^^ = (/ff ^ x)- 

3. Large graph limit. Volz [27] proposed a parsimonious deterministic 
approximation to describe the epidemic dynamics when the population is 
large. However, the stochastic processes are not clearly defined and the con- 
vergence of the SDEs to the four ODEs that Volz proposes is stated but 
not proved. Using the construction that we developed in Section 2.2, we 
provide mathematical proofs of Volz's equation, starting from a finite graph 
and taking the limit when the size of the graph tends to infinity. Moreover, 
we see that the three distributions /i^, ij}^ and fi^^ are at the core of the 
problem and encapsulate the evolution of the process. 
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3.1. Law of large numbers scaling. We consider sequences of measures 
(/x"'^)„6N, (M"''^^)neN and (/u"''^^)ngN such that for any nSN*, /x"'^, //"'^^ 
and ^"'^s satisfy (2.7)-(2.9) with initial conditions //q' , //q' and //q' . We 
denote by s", i" and r" the subclasses of susceptible, infectious or removed 
individuals at time t, and by iVj ' , iVj ' and N^ ' , the number of edges 
with susceptible ego, infectious ego and susceptible alter, removed ego and 
susceptible alter. The number of vertices of each class are denoted /", S" 
and i?". The total size of the population is finite and equal to Sq + Iq + 
Rq. The size of the population and the number of edges tend to infinity 
proportionally to n. 

We scale the measures the following way. For any n > 0, we set 

(n),IS 1 n,IS 



for all t > (and accordingly, /xj" and /Uj ). Then, we denote 



and accordingly, N^''^'^, ivf )'^s, ^J") and i?|"\ 
We assume that the initial conditions satisfy: 

Assumption 3.1. The sequences {n^f^^'^)neN: (/"o"'^'^^)nGN and (/i[,")'^^)„gN 
converge to measures /Iq, /Zq^ and /Iq^ in 7V4i;'(N) embedded with the weak 
convergence topology. 

Remark 1. (1) Assumption 3.1 entails that the initial (susceptible and 
infectious) population size is of order n if p,Q and /Z^f are nontrivial. 

(2) In case the distributions underlying the measures fi^' , ^q' and //q' 
do not depend on the total number of vertices (e.g., Poisson, power-laws or 
geometric distributions). Assumption 3.1 can be viewed as a law of large 
numbers. When the distributions depend on the total number of vertices N 
(as in Erdos-Renyi graphs), there may be scalings under which Assump- 
tion 3.1 holds. For Erdos-Renyi graphs, for instance, if the probability p^r 
of connecting two vertices satisfies liniAr^+oo Npi\f = A, then we obtain in 
the limit a Poisson distribution with parameter A. 

(3) In (1.4), notice the appearance of the size biased degree distribution 
kp,^{k)/Ng . The latter reflects the fact that, in the CM, individuals having 
large degrees have higher probability to connect than individuals having 
small degrees. Thus, there is no reason why the degree distributions of the 
susceptible individuals P-q/Sq and the distribution X^fcgp}Pfc<^fc underlying 
the CM should coincide. Assumption 3.1 tells us indeed that the initial 
infectious population size is of order n. Even if Iq/Sq is very small, the 
biased distributions that appear imply that the degree distribution P^q/Iq 
should have a larger expectation than the degree distribution Pq/Sq. 
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We obtain rescaled SDEs which are the same as the SDEs (2.7)-(2.9) 
parameterized by n. For all t > 0, 

,{n),S _ ,,{n),S _ 1 /■■ '■ 

(3.1) ^ 



(n),S _ (n),S ^ X -n n 



j=l 



J^ Jo JSi 



(3.2) 



(n),RS (n),RS . ^ f f I V^/e r \ 

(3.3) 



X 1 



„,,,.,, („)JS.1„ , , |. (n),RS^ d(5 






where we denote for all s > 0, 

^n,IS 

^"(^)=^^^^ and 
(3.4) 

^n,IS_-^ ^n,RS j^n.S _j^n, IS _j^n,RS 

I fc-1 ; 

Several semimartingale decompositions will be useful in the sequel. We 
focus on ^(")'^^ but similar decompositions hold for /i^")'^ and ^(")'RS, which 
we do not detail since they can be deduced by direct adaptation of the 
following computation. 



Proposition 3.2. For all f G Sb(N), for all t > 0, 

(3.5) (/.(")'^^ /) = y: /(A^)^^'^'!^) + 4"^'^''' + m(")'^^'^ 

fceN 
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where the finite variation part A^ of {fi^ ,/) reads 

■^° km j+e+i<k 

ueu 
(3.6) X (f(k-{j + l+l)) 



j=i 



/o 

and where the martingale part M^ of {fJ.^ , /) is a square integrable 

martingale starting from with quadratic variation 



* njQ 



^ Jo trz, . , 77T^,. 



fceN j+(.+l<k 



X 



X f/(fc-(j + ! + £)) 



I" 



+ E(/(7*(^"'') - ^^) - /(7.(/^"''))) d.. 



4=1 / 

Proof. The proof proceeds from (3.2) and standard stochastic calculus 
for jump processes (see, e.g., [13]). D 

3.2. Convergence of the normalized process. We aim to study the limit 
of the system when n — )• +oo. We introduce the associated measure spaces. 
For any e > and A> 0, we define the following closed set of A4i;'(N) as 

(3.7) Me,A = {i^e Mpm {v, 1 + x"^) < ^ and (z/, x) > e} 

and Mq+^a = Ue>o-^^:-4- Topological properties of these spaces are given 
in the Appendix. 
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In the proof, we will see that the epidemic remains large provided the 
number of edges from i to S remains of the order of n. Let us thus define, 
for all e > 0, e' > and n e N*, 

(3.8) t,,:=mi{t>0,{flf,x)<e'} 
and 

(3.9) T; = inf{i>0,(MS"^'^^x)<e}. 
Our main result is the following theorem. 

Theorem 1. Suppose that Assumption 3.1 holds with 

(3.10) (4"^'',M!,"^''',/xi"^'''') in [M^^A? for any n, with {fxf ,x) > 0. 
Then, we have: 

(1) there exists a unique solution {p.^ , p}^ , jl^^) to the deterministic system 
of measure-valued equations (1.3)-(1.5) in C(]R+, A^o,A x -^o+.A x -^o.a); 

(2) when n converges to infinity, the sequence (//^""^,/i''"'^'^^,/z'"^'^^)n6N 
converges in distribution in P(]R_|_, A4q ^) to {pP , fl^^ , fi^^) . 

Proof. Uniqueness of the solution to (1.3)-(1.5) is proved in Step 2. 
For the proof of (2), since lim^i^Qti,! = +oo, it is sufficient to prove the 
results on 'D{[0,tsi],J^Q ^) for e' small enough. In the sequel, we choose 

<£<£'< (/xJ)^x)• 

Step 1. Let us prove that (/i(")'^,/i(")'^^/n(")''^^)„6N* is tight. Let teR+ 
and n € N*. By hypothesis, we have that 

(/^^^^ i+x')+ (/^S"^'^^ i+x')+ (/^j"^'"", 1 + x') 
<(^(")'^l+x^)+(/.("■)■^^l+x^)<2A 

Thus, the sequences (//^ )neN*, (A*t )neN* and (//^ )neN* are tight 
for each t € M+ . Now by the criterion of Roelly [24] , it remains to prove that 

for each / € Sb(N), the sequence ((^uf"^'^,/), (/x^"^'^^, /), (^u^"^'^^, /))„,6n* is 
tight in P(M+,M'^). Since we have semimartingale decompositions of these 
processes, it is sufficient, by using the Rebolledo criterion, to prove that the 
finite variation part and the bracket of the martingale satisfy the Aldous 
criterion (see, e.g., [16]). We only prove that (/x ,/) is tight. For the 
other components, the computations are similar. 

The Rebolledo- Aldous criterion is satisfied if for all a > and r]> there 
exists no € N and 5 > such that for all n> uq and for all stopping times 5^ 
and Tn such that Sn <Tn < Sn + S, 

p(|^(n),is,/_^{n),is,/|^^^^^ and 
(3.12) 

P(|(mW'IS'/)^^ - (m(")'IS'/)^J > r?) < a. 
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For the finite variation part, 



E[|A 



(n),IS,/ 



On ' 



<E 



/ /3||/||oo(Mi")'^^l)d. 

V Or) 



+ E 



r„ 



fcGN j+^<fc-l 



-l)(2j + l)||/||oods 
1) is the mean number of links to l" that 



The term T,j+£<k-i3PUJ,^\' 

the newly infected individual has, given that this individual is of degree k. 

It is bounded by k. Then, with (3.4), 



E[|A 



in),lSj 

<6E\ 



A 



W,is,/| 

Sn I 



(n) ^{n)^ 



(5r+^. 







+ r 



,(/.(")'^2/ + 3x)], 



by using that the number of infectives is bounded by the size of the pop- 



(n),S, 



,(").S 



ulation and that fig (k) < fi^ (k) for all k and s > 0. From (3.10), the 
right-hand side is finite. Using Markov's inequality. 



p(|^W,IS,/ 



A 



{«),IS,/| 

•Jn ' 



>r])< 



{5r + 2f3)A6\\fl 
V 



which is smaller than a for 6 small enough. 

We use the same arguments for the bracket of the martingale 



E[|(mW'IS'/)^^-(M(")'IS'/)^^ 



(3.13) 



<E 



mf\\liSl^''^ + 4'^) , 5r||/||^(/.[,")■^x(2x + 3)2) 



+ 



n 



n 



< 



(25r + 2/3)A5||/|| 



2 

oo 



n 



using assumption (3.10). The right-hand side can be made smaller than rja 
for a small enough 5, so the second inequality of (3.12) follows again from 
Markov's inequality. By [24], this provides the tightness in 2?(M_|_, AIq ^). 

By Prohorov theorem (e.g., [11], page 104) and Step 1, the distributions 
of (;u(")'^,/i("')'^^,/i(")'^^), for n € N*, form a relatively compact family of 
bounded measures on P(IR+,A4q^), and so do the laws of the stopped 

processes (^^"|'„ , //.";'„ ^fJ-^-^n' )neN* [recah (3.9)]. Let p, := {p,^ , jl^^ , fl^^) be 
a limiting point in C(M+ , A^q a) ^^ ^^^ sequence of stopped processes and let 
us consider a subsequence again denoted by fx^'^' := (/z^"')'^, /x^")'^^, /i(")'^^)neN* 
with an abuse of notation, and that converges to p,. Because the limiting val- 
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ues are continuous, the convergence of (//'"'^)„gN* to ft holds for the uniform 
convergence on every compact subset of M+ (e.g., [6], page 112). 

Now, let us define for all t S M4. and for all bounded function / on N, 
the mappings *j'^, ^f'^ and ^f^'^ from P(M+,A^3^^) into P(IR+,IR) such 
that (1.3)-(1.5) read 



(3.14) 



{{iiif),{fifj),{firj)) 



-,SJ( 



(^,^'^(/2^A-,/2-),^r'^(/2^A-,/2^-),*^^(/2^A^^/i-)). 



,RS,/^ 



Our purpose is to prove that the limiting values are the unique solution of 
equations (1.3)-(1.5). 

Before proceeding to the proof, a remark is in order. A natural way of 
reasoning would be to prove that \I'^'-' , "^^^'J and ^^^'J are Lipschitz contin- 
uous in some spaces of measures. It turns out that this can only be done by 
considering the set of measures with moments of any order, which is a set 
too small for applications. We circumvent this difficulty by first proving that 
the mass and the first two moments of any solutions of the system are the 
same. Then, we prove that the generating functions of these measures satisfy 
a partial differential equation known to have a unique solution. 

Step 2. We now prove that the differential system (1.3)^(1.5) has at 
most one solution in C(M+,A^o,A x -^o+,A x ■M.o,a)- It is enough to prove 
the result in C{[0,T],Mo,a x Ms,a x Mq^a) for ah e > and T > 0. Let 
//* = (/2^''',//^^'*,/i^^'*), i G {1,2} be two solutions of (1.3)-(1.5) in this space, 
started with the same initial conditions. Set 
3 



^t = X] K^t'^^^) - (/^t'^ x^) 



j=o 



j=o 

Let us first remark that for all < t < T, iVf > Nj^ > e and then 



1-1,1 -1,2 1 
\Pt -Pt I 



(3.15) 



j^lSA j^lS,2 



A. 



s,i 



N, 



S,2 



= ^KAi'\x) - (Ap,x)l + -K^f '\x) - (Af '^x)l 

<4t.. 



The same computations show a similar result for \p^ — Pt \ 
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Using that /x* are solutions to (1.3)-(1.4) let us show that T satisfies 
a Gronwall inequality which implies that it is equal to for all t <T. For the 
degree distributions of the susceptible individuals, we have for p € {0, 1, 2, 3} 
and / = x^ in (1-3) 



\{f,'\xn-{fit'^xn\ 



fceN 



e^ Jo 

by using (3.15) and the fact that JIq S A^o,a- 

For /i^^ and /i^^, we use (1.4) and (1.5) with the functions / = x^ ■, f = X 
and f = x^- We proceed here with only one of the computations, others can 
be done similarly. From (1.4), 

(/if '\ 1) - (/^f '^ 1) 

Jo Jo 

Hence, with (3.15), 

\{flf''-flf'\l)\<Ci/3,r,A,e) f T,ds. 

Jo 

By analogous computations for the other quantities, we then show that 

Tt<C'{(3,r,A,e) I T,ds, 
Jo 

hence, T = 0. It follows that for all t <T, and for all j G {0, 1, 2}, 
(3.16) (/ij'\x^) = (/iP,X^) and (/zf '\x^) = (/if '^X^), 

and in particular, iVj ' = N^' and N^ ' = N^ ' . This implies that p^' = 
Pt' iPt —Pt aiidp^' =pf' . From (1.3) and the continuity of the solutions 
to (1.3)-(1.5), pathwise uniqueness holds for p,^ a.s. 

Our purpose is now to prove that p^^'^ = p^^''^. Let us introduce the fol- 
lowing generating functions: for any t G M+, i € {1, 2} and i] £ [0, 1), 

gi{v) = Y.^'pf'{k). 

k>0 
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Since we already know these measures do have the same total mass, it boils 
down to prove that Q^ = Q'^. Let us define 



j+£+m=k-l 

(3.17) 



iPsyiPsnPsrr^'^mk)ds, 






Kt = Yrkpl{k-l)p- 



The latter quantities are, respectively, of class C^ and C° with respect to 
time t and are well defined and bounded on [0,T]. Moreover, H and K 
do not depend on the chosen solution because of (3.16). Applying (1.4) to 
f{k) =7]^ yields 

Q\(3D = Q\,{ri)^H(t,ri)^ CU, Y^ {rj''-' - v'')k'flf'\k') - mir^)) ds 

= gl{r^)+H{t,v)+ f {Ks{l-v)dr,gi{v)-PGl{v))ds. 
Jo 

Then, the functions 1 1— > ^((f?) defined by QKr]) = e^^Ql{r]), i € {1,2}, are 
solutions of the following transport equation: 

(3.18) dtg{t,r]) - {1 - r,)Ktd^g{t,T]) = dtH{t,r])e^'. 

In view of the regularity of H and K, it is known that this equation admits 
a unique solution (see, e.g., [12]). Hence, GKrj) = Giiv) ^^^ ^-H ^ ^ ^+ &i^d 
rj G [0,1). The same method applies to /i^^. Thus, there is at most one 
solution to the differential system (1.3)~(1.5). 

Step 3. We now show that yu'"-* nearly satisfies (1.3)-(1.5) as n gets large. 
Recall (3.5) for a bounded function / on N. To identify the limiting values, 
we establish that for all n G N* and all i > 0, 

(3.19) (A.l;:J:J^ /) = ^^4 (/^^"^) + ^r^t + O?'^ 

where M^'^''^^'-' is defined in (3.5) and where A^:J^„ converges to when 
n -^ +00, in probability and uniformly in t on compact time intervals. 
Let us fix i € 1R+. Computations similar to (3.13) give 



(3.20) E((Mf ^''^'^)^) = E((M(")'IS,/^^) < 



(25r + 2/3)^i||/|| 



2 

00 



n 



Hence, the sequence (M^ )neN converges in L^ and in probability to 

zero (and in L^ by Cauchy-Schwarz inequality). 

We now consider the finite variation part of (3.5), given in (3.6). The sum 
in (3.6) corresponds to the links to i that the new infected individual has. 
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We separate this sum into cases where the new infected individual only has 
simple edges to other individuals of i, and cases where multiple edges exist. 
The latter term is expected to vanish for large populations: 

(3.21) ^(n),IS,/^^(n),IS,/^^(n),IS,/^ 

where 



(3.22 



Jo 

•^° ken j+e+i<k 

xU{k-{j + l + £))+ E P(^IJ + 1'/^ 



ueu{j+i,fis' ); 

VJ</? ,Ui<l 



X 



E(/(^^(^-')-^^)-/(^^(^"-'))) d^ 



j=0 



and 



•^0 feeN j+£+l<fc 

(3.23) X E p(nb- + l,Ms''^) 



«GW{i+l,M?'^^); 
3i<7^_,ni>l 

X 



E(/(^^(/^-')-^^)-/(7^(/^"-')))d«- 



We first show that Cj is a negligible term. Let q^^ ^ denote the proba- 

bility that the newly infected individual at time s has a double (or of higher 
order) edge to some alter in i"_, given j and i. The probability to have 
a multiple edge to a given infectious i is less than the number of couples of 
edges linking the newly infected to i, times the probability that these two 
particular edges linking i to a susceptible alter at s_ actually lead to the 
newly infected. Hence, 



nGW(i+l,M?'^^); 



n,IS) 
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< 



j\ V- 4(sL)(4(sL)-i) 



E 



(3.24) 






< 






2y ne(e — 1/n) 
Then, since for all u € U{j + 1, ^u"' ), 



(3.25) 



if s < r" and n > 1/e. 



< 2(j + 1 



we have by (3.24) and (3.25), for n > 1/e, 

\ci:lf'^\< j;rA:/x(")'«(fc) ^ p«(j,£|fc - l)2(j + 1)||/||, 



fcGN 



i+£+l<fc 



(3.26) 



2ne(e — 1/n) 



< 



^r-t||/||oo (n,),s 4 

\A'0 1 A / ; 



ne(e — 1/n) 

which tends to zero in view of (3.10) and to the fact that fi" is dominated 

by ;u{,"^'^ for ah s > and neW. 

We now aim at proving that B^l'^ is somewhat close to "^.^^„{fi^"''). 
First, notice that 

,n,IS\ 



Wi<IS' ,Ui<l 



nGW(i+l,M?'^^); 



jn 



E(/(^^(^"-')-^^)-/(^^(^-'))) 



i=l 



E 



Me{ij_)J+i 



1 1a:=0 ^Mfc (^5 . 



N. 



n,IS 



(iV-.-_(j + i)) 



E(/('^-"^(^"-)-i)-/(^«'"(sL))) 



m=0 
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i / rri 



^n.^rrn v,+iViv.: ••■(iv.: -(j + i))y 



m=OMe{ij_)j+i 



x(/K™(sL)-i)-/K„(sL))) 
E E -r^(/(rf^(sL)-i)-/(rf.(sL))) 



m=0 ^x6IJ_ 



, ^, ,, , (ivr-''^ -!)••• (A^r'^ - (j + 1)) 

where we recall that Tif{k) = f{k — 1) for every function / on N and /c G N. 
In the third equality, we split the term Um from the other terms {umi)m'^m- 
The last sum in the right-hand side of this equality is the probability of 
drawing j different infectious individuals that are not Um and that are all 
different, hence, 1 — g"_i „ „. 
Denote for t > and n G N, 

n,i _ (/^r'^^^x)-! 

Pt / n,S \ 1 ' 

/ n,RS \ 
n,,R ^ (^f , X) 
Pt / n,S \ 1 ' 

(/^i ,x)-i 

/ n.S \ / n,IS \ / n,RS \ 
n,S _ (/^f ,X)-{lJ-t ,X)-{l^t ^ X) 
"t / n,S \ 1 ' 

the proportion of edges with infectious (resp., removed and susceptible) 
alters and susceptible egos among all the edges with susceptible egos but 
the contaminating edge. For all integers j and i such that j + i <k — 1 and 
n € N* , denote by 

the probability that the multinomial variable counting the number of edges 
with infectious, removed and susceptible alters, among k — 1 given edges, 
equals {j, i,k — 1 — j — I). We have that 
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where 



■^0 fcGN 



X J2 {P^iJJ\k-l)-p:{jJ\k-l)) 
j+e+i<k 



x(/(A:-(j + £+l)) 

, ^. , -|^ (4-^'^^x(n/-/)) \ ^ 

+ U + 1) -7^TT5 ds, 



(n),IS 



Jo ,,^„,, 



° fceN 



X Y. PlUJ\k-l){j + l) 
j+e+i<k 



(3.29) iA^;:if% P"E^^<(^^)ii/ii-fi+^)/^i"^''(^)d^ 

Jo t^ V e / 



First, 

r-tArP 

fcGN 

where for all A; € N 

j+i+l<k 

The multinomial probability p^{j,i\k — 1) approximates the hypergeometric 
one, p"{j,^k — l,s), as n increases to infinity, in view of the fact that the 
total population size, (/Iq' ,1) + (//q' ,1), is of order n. Hence, the right- 
hand side of (3.29) vanishes by dominated convergence. 
On the other hand, using (3.24), 

" Jo ^^ r's \ J 2ne(e - In) 

(3.30) 



ne^{e — 1/n) ' 

in view of (3.10). Gathering (3.20), (3.21), (3.26), (3.28), (3.29) and (3.30) 
concludes the proof that the rest of (3.19) vanishes in probability uniformly 
over compact intervals. 
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Step 4. Recall that in this proof, fi = {fL^ , p}^ , fi^^) is the limit of fi'^ln = 
(//."!'„, /x"|n j/iYsT-'n )neN*, and recall that these processes take values in 
the closed set A^q^- Our purpose is now to prove that fl satisfy (1.3)~(1.5). 
Using the Skorokhod representation theorem, there exists, on the same prob- 
ability space as /i, a sequence, again denoted by (/i."|,i)neN* with an abuse 
of notation, with the same marginal distributions as the original sequence, 
and that converges a.s. to jl. 

The maps u. := {u\u^,u^) ^ {u\ 1) / {{u^ , 1) + {i^ll) + (z^^^l)) [resp., 
(1.2, l)/((z/i, 1) + {'yl 1) + Wl 1)) and {u^ l)/((^o'> 1) + (^o> 1)) + (^0, 1)1 are 
continuous from C(M+,7Wo,A x Me,A x Mq^a) into C(R+,M). 

Then, Lemma A. 5 together with the continuity of (X.^,X.2) i-^ X^/X"^ 
from C(M+,M) x C(M+,M*) into C(M+,M) (see, e.g., [28]), implies that the 
mapping u. i-^ (i^.^,x)/(^.^)X) is continuous fromC(M_|_, A4o,a x -M-s^ax -Mq^a) 
into C(M4.,IR). The same argument yields the continuity of v. i-?- 1/^,1 w^/ 

{v'^-.x) for the same spaces. 

Lemma A. 5 also provides the continuity of v. i— ?> {i^'^,x{Tif ~ /)) from 
C(M+,A^o,A X Me^A X Mo,a) into C(]R+,M) for bounded function / on N. 

Since, as well known, the mapping y G D([0,t],]R) i— )■ f^ t/sds is continu- 
ous, we have proven the continuity of the mapping ^( defined in (3.14) on 

V{R+,Mo,AXMe,AXMo,A)- 

By Lemma A.S applied to ip = x-, the process {N J^^U )nGN* converges in 
distribution to N^^ = {fi}^,x)- Since the latter process is continuous, the 
convergence holds in (I?([0,T],M+), || • ||oo) for any T > (see [6], page 112). 
As y S P(M+,R) I— 7- infjgro^T] uit) G IR is continuous, we have a.s. that 

inf iVf = lim inf N^^lf {>e). 

te[0,T] n->+oo46[o,T] ^ 

We consider t^/ = inf{t € R^jN^ < e'}- A difficulty lies in the fact that we 
do not yet know whether this time is deterministic. We have a.s. 

(3.31) e'< inf Nj^r = lim inf A^/^^f^r . 

Hence, using Fatou's lemma, 

1 = P(' inf iVf >e 

VtG[0,t^/] 

(3.32) < lim Pf inf N^^^lf > e 
= lim P(r">rAV). 

n— >-+oo 

Hence, we have 
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From the estimates of the different terms in (3.19), ^^^^„^j,{fi^ '>) is upper 

bounded by a moment of /.f'") of order 4. In view of (3.10) and (3.32), the 
first term in the right-hand side converges in L^ and hence, in probabil- 
ity, to zero. Using the continuity of ^^^'-f on P(M+,7\4o,A x Me,A x A^o,a), 
^^^'■' (a* At") converges to ^^^'-^(/i) and, therefore, ^^^ ^^^(/x "|n) converges 

TO J" 

to ^.a'/ at(^)' Thanks to this and (3.32), the second term in the right-hand 

TO J" 

side converges to ^.// /\t(P') ™ I?(M_|_,M). 

Then, ((/^i "^nAi^,AT' /) ~ ^^A4AiVAT(/^^"^))"GN* converges in probabihty 

TO J" 

to (a. At,/ AT)/) ~ ^^-At AT^^)' -f'^o™ (3-19), this Sequence also converges in 
probability to zero. By identification of these limits, fi^^ solves (1.4) on 
[0,t£/ AT]. If (/io^,x) > then similar techniques can be used. Else, the 
result is obvious since for all t G [0,te' A T], (/ij" ,%) > e and the term 
p't{j,^\k — 1) is negligible when i > 0. Thus fl coincides a.s. with the only 
continuous deterministic solution of (1.3)~(1.5) on [0,i^' AT]. This implies 

that is' = t^i and yields the convergence in probability of (// AT")nGN* to /i, 
uniformly on [O,^^'] since /2 is continuous. 

We finally prove that the nonlocalized sequence (^*'"^)„gN* also converges 
uniformly and in probability to fl in P([0,te'], A^o.A x ■Me,A x ■Mq,a)- For 
a small positive rj, 

p( sup |(A4")'^^/)-M/f'^(A)|>7?) 

(3.33) <P( sup |vI/f^'4(^W)-^f'^(/Z)|>^;r;>v) 

^te[o,t^ 

Using the continuity of '^^ and the uniform convergence in probability pro- 
ved above, the first term in the right-hand side of (3.33) converges to zero. 
We can show that the second term converges to zero by using Doob's inequal- 
ity together with the estimates of the bracket of M^"^'^^'-* [similar to (3.13)] 
and of A"'-' (Step 2). Finally, the third term vanishes in view of (3.32). 

The convergence of the original sequence {ijS'^')n£n* is then entailed by 
the uniqueness of the solution to (1.3)-(1.5), implied by Step 2. 

Step 5. When n — > +oo, by taking the limit in (3.1), {ijy^''^)n£W converges 
in X'(M_|_, A^o,a) to the solution of the following transport equation, that can 
be solved in function of p^. For every bounded function / : (fc,t) i— >■ ft{k) € 
Cj' (N X M_|_,]R) of class C^ with bounded derivative with respect to t, 

(3.34) (A?, ft) = ifllfo) - I {plrxPlfs - dsfs) ds. 

Jo 



+ P( sup \A^;,[.+MiZ'r\>j]+nr^<ts'). 
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Choosing f{k,s) = ip{k) exp{—rk j^ ^p^{u)du), we obtain that 

(3.35) {fil^) = Y,y^{k)e'^fil{k), 

kern 

where 9t = exp(— r L p^{u) du) is the probabihty that a given degree 1 node 
remains susceptible at time t. This is the announced equation (1.3). D 

We end this section with a lower bound of the time t^i until which we 
proved that the convergence to Volz's equations holds. 

Proposition 3.3. Under the assumptions of Theorem 1, 



(3.36) te> > V 



max(/3,r) 



Proof. Because of the moment assumption (3.10), we can prove 
that (3.19) also holds for f = x- This is obtained by replacing in (3.20), 
(3.26), (3.29) and (3.30) ||/||oo by k and using the assumption of bound- 
edness of the moments of order 5 in (3.26) and (3.30). This shows that 
(A^^"^'^^)ri,GN converges, uniformly on [0, i^/] and in probability, to the de- 
terministic and continuous solution N^^ = {p}^ , x) ■ We introduce the event 
Ac = {| Nq' — uNq^ \< ^} where their differences are bounded by ^ > 0. 
Recall the definition (3.9) and let us introduce the number of edges Z" that 
were IS at time and that have been removed before t. For t>T^,, we have 
necessarily that Z" > Nq' — ne' . Thus, 

IP({tP <t}n A^) < P({Zj" > iVo"''" - ne'} n A^) 
(3.37) 

When susceptible (resp., infectious) individuals of degree k are contaminated 
(resp., removed), at most k is-edges are lost. Let X"' be the number of edges 
that, at time 0, are IS with susceptible alter of degree k, and that have 
transmitted the disease before time t. Let Y^' be the number of initially 
infectious individuals x with dx{So) = k and who have been removed before 
time t. X^' ' and Y^' ' are bounded by kfi^' (k) and ^q' (k). Thus, 

(3.38) Zf < Y^ k{X'^''' + y^^''^). 

k&i 

Let us stochastically upper bound Z". Since each is-edge transmits the dis- 
ease independently at rate r, X^' is stochastically dominated by a binomial 
r.v. of parameters A;/Xq' {k) and 1 — e~^*. We proceed similarly for Y^' . 
Conditionally to the initial condition, X"' + Y^' is thus stochastically 
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dominated by a binomial r.v. Z"' of parameters (kfi^' (k) + /ig' (k)) and 
1 - e-°^^^(^'^)*. Then (3.37) and (3.38) give 

(3.39) P({tP < t} n ^^) < Pf ^ —^ > iV^^ - e' - - J . 



Thanks to Assumption 3.1 and (3.10), the series X^fceN^-^"' /''^ converges 
in L^ and hence, in probabihty to ((/^qiX^) + ^o^)(l — g-max(/3,r)t-j ^i^^^ 
n —7- +00. Thus, for sufficiently large n, 

P({r" < t} n ^^) = 1 if t > V and if t < v. 

For all t < r^/, it follows from Assumption 3.1, (3.10) and Lemma A. 4 that 

lim P(r,^<t)< lim (P({tP < t} n >1?) + P((^?)'=)) = 0, 

SO that by Theorem 1 

1 = lim P(r!] >f^>)= lim Pf" inf iV^")'^*^ > e') = Pf inf iVf > e') . 

n->-+oo n->-+oo \i<T^/ / \t<T^i J 

This shows that t^/ > f^' a.s., which concludes the proof. D 

3.3. Proof of Volz's equations. 

Proposition 3.4. The system (1.3)-(1.5) implies Volz's equations (1.8)- 
(1.11). 

Before proving Proposition 3.4, we begin with a corollary of Theorem 1. 

Corollary 3.5. For all t e M+ 
Nf = 0t9'{et), 

(3.40) Nf = N^' + 1^ rp%g'{9s) ((p1 - pDe^ ^ - l) - /3iVf d., 

ivr = fmi' - rpfpleyies)) ds. 

Jo 

Proof. In the proof of Proposition 3.3, we have shown that (A^^"^'^^)„gN 
converges uniformly on compact intervals and in probability to the deter- 
ministic and continuous solution N^^ = (/i^^,x)- Equation (1.3) with f = x 
reads 

(3.41) Nf = Y,fimk9^ = 9tY,fiUk)k9t' = 9tg'{et), 

fceN k=l 

that is, the first assertion of (3.40). 
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Choosing f = x iii (1-4), we obtain 



iVf = N, 



Jo Jo f~r, , , TTT , 



^ -^O fcGN j+e<k-l 



MyiPfYiPtr 



/Z^(A:)ds. 



[j\{k - 1 - j - i)w: 

Notice that the term in the square brackets is the probabihty to obtain 
(j, i,k—l—j—£) from a draw in the multinomial distribution of parameters 
{k-1, {Pl,pf,p^s))- Hence, 

as we recognize the mean number of edges to i^ of an individual of degree k. 
Other terms are treated similarly. Hence, with the definition of Xs{k), (2.1), 



JO 

t 
(3Nf ds. 





But since 

fceN 
iPlx' - 2x) = iPlxix - 1)) - {Pf,x) = eh"i0t) - 0t9'i9t), 
we obtain by noticing that 1 — 2pl — p^ =p^ ~ PI^ 

(3.42) TVf = 7V^s+ / rpliipt-pl)9h"{es)-e,g'{es))ds- [ /3iVf ds, 
Jo Jo 

which is the second assertion of (3.40). The third equation of (3.40) is ob- 
tained similarly. D 

We are now ready to prove Volz's equations. 

Proof of Proposition 3.4. We begin with the proof of (1.8) and (1.9). 
Fix again t > 0. For the size of the susceptible population, taking c/? = 1 
in (1.3), we are led to introduce the same quantity 9t = exp(— r L p^ds) as 
Volz and obtain (1.8). For the size of the infective population, setting / = 1 
in (1.4) entails 

ft 



h = h+ I (y^rkp\jll{k) - (3Is) ds 
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= Io+ t (rpl j; fimke's - pis) ds 
■^0 ^ ken ^ 

' {rp%g'{9s)-f3ls)ds 

'0 

by using (1.3) with / = x for the second equahty. 

Let us now consider the probabihty that an edge with a susceptible ego has 
an infectious alter. Both equations (1.8) and (1.9) depend on p] = iV/^/iV^f. 
It is thus important to obtain an equation for this quantity. In [27], this 
equation also leads to introduce the quantity p^. 

From Corollary 3.5, we see that N^ and N^^ are differentiable and 

~dt ~ di\N[ ) ~ Nf di^ * ^~(iVf)2dt^ *^ 



a"(0 ) 
rpliPl - PlWt-f7^ - rp\ - Ppl 



Otg' 



^* {-rpletg'{et) + 9tg"{et){-rplet)) 



H 



= rplplet^-rplil-pl)-/3p] 

by using the equations 1 and 2 of (3.40) for the derivatives of N^ and N^^ 
with respect to time for the second line. This achieves the proof of (1.10). 

For (1.11), we notice that pf = ^—pi—pf and achieve the proof by showing 
that 

(3.43) pf= f {I3pl-rplpf)ds 

Jo 

by using arguments similar as for p]. D 

Remark 2. Miller [18] shows that Volz's equations can be reduced to 
only three ODEs: 

St = giOt), -7^ = pit, It = iSo + Io)-St-Rt, 
at 

^^*- -r0, + /3(l-e,)+/3-^'^^*) 



dt ^ ' "^ '' ' ^ g'{l) 

The last ODE is obtained by considering the probability that an edge with 
an infectious ego drawn at random has not transmitted the disease. How- 
ever, in his simplifications, he uses that the degree distributions Pq/Sq and 
"^keNPk^k are the same, which is not necessarily the case (see our Re- 
mark 1). Moreover, it is more natural to have an ODE on It and Nj^ is 
a natural quantity that is of interest in itself for the dynamics. 
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APPENDIX: FINITE MEASURES ON N 

First, some notation is needed in order to clarify the way the atoms of 
a given element of A4i?(N) are ranked. For all n € ^^^^(N), let F^ be its 
cumulative distribution function and F~^ be its right inverse defined as 

(A.l) VxGM+ F~^{x) = mi{ieN,Ff,{i)>x}. 

Let /i = XlnGN'^™'^" ^^ ^^ integer-valued measure of A1i;'(N), that is, such 
that the a„'s are integers themselves. Then, for each atom n € N of ^ such 
that On > 0, we duplicate the atom n with multiplicity an, and we rank the 
atoms of fi by increasing values, sorting arbitrarily the atoms having the 
same value. Then, we denote for any i < (/i, 1), 

(A.2) j,{^,) = F-\i), 

the level of the ith atom of the measure, when ranked as described above. 
We refer to Example 1 for a simple illustration. 

We now make precise a few topological properties of spaces of measures 
and measure-valued processes. For T > and a Polish space {E,dE), we 
denote by T>{[0,T],E) the Skorokhod space of cadlag (right-continuous left- 
limited) functions from M to £^ (e.g., [6, 16]) equipped with the Skorokhod 
topology induced by the metric 



dT(f,g) := inf < sup 

aGA([0,T])[(,^t)g[0,7.] 



(A.3) 



2 



a(s) — a(t) 
log : 



+ supds(/(t),g(a(t))) 



t<T 



where the infimum is taken over the set A([0,T]) of continuous increasing 
functions a : [0, T] -^ [0, T] such that a(0) = and a{T) = T. 

Limit theorems are heavily dependent on the topologies considered. We 
introduce here several technical lemmas on the space of measures related 
to these questions. For any fixed < e < A, recall the definition of A4s,a 
in (3.7). Remark that for any z/ € Ms^a, and i G {0, . . . , 5}, (z/, x*) < A since 
the support of i^ is included in N. 

Lemma A.l. Let 3 a set and a family {ut^'t £ 3) of elements of Ais,A- 
Then, for any real function / on N such that f{k) = o{k^), we have that 

lim sup|(i^^,/l[;^ ))| = 0. 

Proof. By the Markov inequality, for any r € 3, for any K, we have 
Y, \fmMk)< A sup ^-^, 

k>K ^^^ ^ 
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hence, 

lim sup\{ur,f)\<Al[msup^-^^ = 0. 

The proof is thus complete. D 

Lemma A. 2. For any ^4 > 0, the set Me, a is a closed subset of Mf(^) 
embedded with the topology of weak convergence. 

Proof. Let (/i„)„gM be a sequence of M.e,A converging to /.i G 7V4i?(N) 
for the weak topology, which implies in particular that linin^^ao fJ-nik) = 
n{k) for any /c € N. Denoting for all n and A; € N, fn{k) = k^fi'^{k), we have 
that lim„_j.+oo/n(^) = f{k) := k^fi{k), /i-a.e., and Fatou's lemma implies 

(/i,X^)= V/(/c)<lim inf V/„(/c)=lim inf (/i„,x^>- 

^— ' n— ^oo ^—^ n— >oo 

fcGN fceN 

Since {fin, 1) tends to (/i, 1), we have that (^, 1 + x^) < A. 

Furthermore, by uniform integrability (Lemma A.l), it is also clear that 

e< lim {l^n,x) = {l^,x), 

which shows that n € Me,a- D 

Lemma A. 3. The traces on Me, a of the total variation topology and of 
the weak topology coincide. 

Proof. It is well known that the total variation topology is coarser 
than the weak topology. In the reverse direction, assume that (^n)neN is 
a sequence of weakly converging measures belonging to Me,A- Since 

fceN 

according to Lemma A.l, it is then easily deduced that the right-hand side 
converges to as n goes to infinity. D 

Lemma A. 4. If the sequence {fin)neN of M^ j^ converges weakly to the 
measure fiGMe,A, then {{fJ-n, f))neN converges to {fi, f) for all function f 
such that f{k) = o{k^) for all large k. 

Proof. Triangular inequality says that 

|(Mn,/> - (/^,/)| < K^nJlfO.A-]) - (A*,/1[0,A-])I 

+ I (M, /l(i^,+oo)) I + I (/^n, /l{i^,+oo)) I • 

We then conclude by uniform integrability and weak convergence. D 
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Recall that Me, A can be embedded with the total variation distance topol- 
ogy, hence, the topology on 'D{[0,T],^As,A) is induced by the distance 

a{s) — a{t) 



Pt{h-,i^-)= inf sup 

aGA([0,T])V(s,i)e[0,T]2^ 



log- 



s-t 



+ sup (Ity {fit, l^a(t)) 
t<T 



Lemma A. 5. For any p<5, the following map is continuous: 
. (V(R+,Me,A)^1>{R+,R), 

Proof. It is sufficient to prove the continuity of the above mappings 
from V{[0,T],Ms,a) to P([0,r],M), for any T > 0, where the latter are 
equipped with the corresponding Skorokhod topologies. For fx and v two 
elements of A4e,A, for any p < 5, for any positive integer K, according to 
the Markov inequality, 

|(/i,X^)-^,X^)l<2^ + |(/i-z.,x^l[o,^]>| 
(A.4) ^ 

Using (A. 3) and (A.4) we have for any K > 0, 

and hence, the continuity of <l>p. D 
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